Size of test dataset: 23
Percentage of successful automatic results: 80%
Percentage of less successful automatic results: 20%